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Abstract. In this paper we study random induced subgraphs of Cayley graphs of the symmetric 
group induced by an arbitrary minimal generating set of transpositions. A random induced sub- 
graph of this Cayley graph is obtained by selecting permutations with independent probability, 
A n . Our main result is that for any minimal generating set of transpositions, for probabilities 
= ^—l wnere n ~3~^" s < e n < 1 and 8 > 0, a random induced subgraph has a.s. a unique 
largest component of size p(e n ) 1 n t lf" nl, where p(e n ) is the survival probability of a specific 
branching process. 



1. Introduction 



One central problem arising in parallel computing is to determine an optimal linkage of a given 
collection of processors. A particular class of processor linkages with point-to-point communication 
links arc static interconnection networks. The latter arc widely used for message-passing architec- 
tures. A static interconnection network can be represented as a graph. The binary n-cubes, Q2, 
[U [27] are a particularly well-studied class of interconnection networks [lOl H2J 021 [32] • 

Akers et al. [2] recognized the deficiencies of n-cubes as models for interconnection networks and 
proposed an alternative: the Cayley graph of the permutation group induced by the (n — 1) star- 
transpositions (li), which we denote by T(S n , P n ). Pak |28| studied minimal decompositions of 
a particular permutation via star-transpositions and Irving et al. |20] extended his results. The 
star-graph T(S n , P n ) is in many aspects superior to n-cubes [l][27]. Some properties of star-graphs 



Date: September, 2009. 

2000 Mathematics Subject Classification. 05A16. 

Key words and phrases, random graph, permutation, transposition, giant component, vertex boundary. 



2 



EMMA Y. JIN AND CHRISTIAN M. REIDYS * 



studied in [TH [TH] HH HZ] [5T] US] were cycle-embeddings and path-embeddings. Diameter and fault 
diameter of star-graphs were computed by Akers et al. [2j [24] [31] and Lin et at [26] analyzed 
diagnosability. An alternative to n-cubes as interconnection networks are the bubble-sort graphs 
[3J , studied by Tchuente [33J . The bubble-sort graph is the Cayley graph of the permutation group 
induced by all n — 1 canonical transpositions (i i + 1), denoted by T(S n , B n ). 

Recently, Araki [5] brought the attention to a generalization of star- and bubble-sort graphs, the 
Cayley graph generated by all transpositions. The latter has direct connections to a problem of in- 
terest in computational biology: the evolutionary distances between species based on their genome 
order in the Cayley graph of signed permutations generated by reversals. A reversal is a special 
permutation that acts by flipping the order as well as the signs of a segment of genes. Hannenhalli 
and Pevzner [29j presented an algorithm computing minimal number of reversals needed to trans- 
form one sequence of distinct genes into a given signed permutation. For distant genomes, however, 
it is well-known, that the true evolutionary distance is generally much greater than the shortest 
distance [55] . In order to obtain a more realistic estimate of the true evolutionary distance, the 
expected reversal distance was shifted into focus. Its computation, however, has proved to be hard 
and motivated to consider models better suited for computation. Point in case is the work of Erik- 
sen et al. [llj . where the authors derive a closed formula for the expected transposition distance 
and subsequently show how to use it as an approximation of the expected reversal distance. 

In this paper we study a subgraph of the Cayley graph generated by all transpositions, the Cayley 
graph r(5 , „,T n ), where T n is a minimal generating set of transpositions. Setting T n = P n and 
T n = B n we can recover the star- and the bubble-sort graph as particular instances. We study 
structural properties of T(S n , T n ) in terms of the random graph obtained by selecting permutations 
with independent probability. The main result of this paper is 

Theorem 1. Let X n = 1 ~ t ^ t , where n~i +s < e n < 1 and 6 > 0. Let T n be a minimal generating set 
of transpositions and let T n = r„(A„) denote the random induced subgraph of T{S ni T n ), obtained 
by independently selecting each permutation with probability X n . Then T n has a.s. a unique giant 
component, Cn \ whose size is given by 

(1-1) \C^~(l-o(l))-p(e n )-±±^.n\, 

n — 1 

where p(e„) = (1 + o(l))x for e n = e > ; x > being the unique root of e~( 1+e ) y = 1 — y and 
p(e n ) = (2 + o(l))e n for < e n = o(l). 
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FIGURE 1 . The evolution of the giant component in random induced subgraphs of 
r(S l g, Pg). We display the relative size of the giant component | r " | as a function 
of (1 + e)/(n — 1) as data-curve (blue) versus the growth predicted by Theorem [1] 
(red). 



Similar to the case of random induced subgraphs of n-cubes |30j . random induced subgraphs of 
T(S n ,T n ) exhibit a giant component for very small vertex selection probabilities. This is insofar 
interesting as important relations, like for instance, the isoperimctric inequality [14] . do not play 
a key role. 

The paper is organized as follows: after establishing in Section [2] some basic properties of T(S n , T n ) 
and preparing all necessary background, we analyze in Section [3] vertices contained in polynomial 
size subcomponents. The strategy is similar to that in [30] . where first a specific branching process 
is embedded (for its first Li n ^J steps) into T(S n ,T n ). It is its survival probability that provides 
a lower bound on the probability that a given vertex is contained in a subcomponent of arbitrary, 
polynomial size. In Section [¥] we "sandwich" this bound by showing that there are many vertices 
in "small" components. Only here we use e < 1. In Section [5] we show that there are many vertex 
disjoint paths between certain splits of permutations. The a.s. existence of the giant component 
follows using the ideas of Ajtai et al. pQ. 
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2. Some basic facts 

Let S n denote the symmetric group over [n]. We write a permutation 7r G S n as an n-tuple 
(x\, . . . , x n ). Let T n C S n be a minimal generating set of transpositions. Plainly, we have for i < j 

(2.1) (xi , . . . , , . . . , Xj , . . . , X n ) (i j) — (x\ , . . . , Xj" j . . . , Xi , . . . , x n ) . 

We consider the Cayley graph T(S n ,T„), having vertex set S n and edges {v,v') where v~ l v' G T n . 
For G S n , let d(v,v') be the minimal number of X^-transpositions by which v and u' differ. 
For A C 5 n we set B(A, j) = {v G 5 n | 3a G A; d(u,a) < j} and 6(A) = {v € S n \A \ 3a £ 
A; d(v, a) = 1} and call B(A, j) and 6(A) the ball of radius j around A and the vertex boundary 
of A in T(5„, T„). If A = {a} wc simply write B(a,j). Let D,E a S n , we call D £-dense in E if 
B(cr, £) n D 7^ for any <r G E. Let "<" be the following linear order over F(S n ,T n ) 

(2.2) a < t ^==> (d(a,id) < d(r,id)) V (d(a, id) = d(r, id) A a <i cx r), 

where id is the identity permutation and <i ox denotes the lexicographical order. Any notion of 
minimal or smallest element in a subset A G S n refers to the linear order < of eq. (|2.2p . 

Let T\ 7i (S n ,T n ) be the probability space (random graph) consisting of T(S n , T„)-subgraphs, r„, 
induced by selecting each T(S n , T„)-vertex with independent probability A„. A property M is a 
subset of induced subgraphs of T(S n , T n ) closed under graph isomorphisms. The terminology "M 
holds a.s." is equivalent to linin^oo Prob(M) = 1. A component of T n is a maximal, connected, 
induced r„-subgraph, C n . The largest r„-component is denoted by cli \ Wc write x n ~ y n if and 
only if (a) lmin^oo x n /y n exists and (b) lim n _, 00 x n /y n — 1. We furthermore write g(n) = 0(f(n)) 
and g(n) = o(f(n)) for g(n)/f(n) —> n &s n —> oo and g(n)/f(n) — > as n — > oo, respectively. A 
largest component is called giant if it is unique in "size", i.e. any other component, C n , satisfies 
\C n \=o(\d 1] \). 

Let Z n = Yli—i Ci be a sum of mutually independent indicator random variables (r.v.), £j having 
values in {0, 1}. Then we have, [5], for rj > and c n = min{- ln(e''[l + i]}^^ 1+ ^), ^ } 

(2.3) Prob( | Z n - E[Z n ] \ > V E[Z n ] ) < 2e~ c ^ z ^ . 

n is always assumed to be sufficiently large and e is a positive constant satisfying < e < 1. We 
write B m (£,X n ) = (" ( l )\ e n (1 - X n ) m - £ . 
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Let us next recall some basic facts about branching processes, T„, = 7 m (p) [E1I23]- Suppose T m 
is initiated at £. Let (Q ), i,t £ N count the number of "offspring" of the ith- "individual" of 
"generation" (t — 1), where the r.v. £ and are £? m (^,p)-distributed. Let To = 7o(p) denote the 
branching process for which £ is B m (£,p)- and all are £? m -i(^,p)-distributed. Furthermore, let 
3 3 p(A), (A > 0) denote the branching process in which the individuals generate offsprings according 
to the poisson distribution, i.e., P(£j = j) = jre~ x . We consider the family of r.v. (Z;); s n : 

Zq = 1 and Z t = 5Jj=i d f° r ^ — 1 an d interpret Z t as the number of individuals "alive" in 
generation t. Of particular interest for us will be the limit lim^oo ¥(Z t > 0), i.e. the probability 
of infinite survival. We write 

MP) = lim MZt > 0), n m (p) = lim P m {Z t > 0) and tt p (A) = lim ¥ P {Z t > 0) 

t — >oo t — >oo t — >00 

for the survival probability of 3>o, T m and J'p(A), respectively. 

Corollary 1. [8] Let p = X/n. 

(1) If X> 1 is fixed, then n (p) = (1 + o(l))7r P (A). 

(2) Let A n = 1 + f-n, where < e„ = o(l). Then, if r = n — s and s — o(ne n ), 

MP) = (1 + o(l)W(p) - (2 + o(l))e„. 

3. Components of size at least 0(ni +kS ) 

Suppose x > is the unique root of e~( 1+e ^ v = 1 — y and 
(3.1) p(e») = 

For k G N we set 



(l + o(l))x for e„ = e > 
(2 + o(l))e„ for < e n = o(l). 



(3.2) /x„ = L 2fc ^ fc 1 _ | _ jT n °J' £ " = L 2 (fc fc |_ i) n ^' and z n = n - kfi n - e n . 
We next establish some basic properties of the Cayley graph T(S n ,T n ): 

Lemma 1. Let T n be a minimal generating set of S n consisting of transpositions, then we have 

(1) T n has cardinality n — 1 and corresponds uniquely to a labeled tree, 7 n over [n]. 

(2) there exists a sequence {vi)2<i such that T n = {(i>j s,) | 2 < i < n} and 

(3.3) V .7 < z; ((sci, . . . ,x n )(vj Sj)) Vi ^ ((xi, . . . ,x n )(vi 
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(3) the diameter of T(S n ,T n ) is given by 

n 

(3.4) diam(T(S„, T n )) < ^ diam(T ? ) < 

i=2 

Proof. It is straightforward to prove by induction that \T n \ = n — 1. We next consider the graph 
T„ over [n], having edge-set T„. Since (T n ) = S n , T n is connected and since T n is independent, T„ 
is a tree. This establishes the mapping 

ip: {T n | T n is a maximal independent transposition set} — > {T„ | T n is a tree over [n]}. 

Furthermore, ?/> has an inverse; as the edges of a tree over [n] give rise to a maximal independent 
set of transpositions that generate S n , whence assertion (1). 

In order to prove (2), we generate the tree T n inductively as follows: we start with vertex 1 and 
consider its unique minimal clement in distance 1, Vi, and set Ti = {{vi 1)}. Given Tj, we consider 
the transposition (i>;+i Sj+i), where Uj+i is the unique minimal element contained in T n \0*j, having 
minimal distance to 1, and Sj+i is its unique T^-ncighbor. We then set Tj+i = T, U {(Wj+i Si+i)}. 
This process gives rise to the sequence of trees T2 C T3 C • ■ ■ C T„ and denoting the vertex sets 
of Tj by y i; we have Vi = {1} C V 2 C F 3 C . . . K-i C K = [n] where = Vi \ V5_i. By 
construction 

V j < i; ((xi,...,x n )(vj Sj)) Vi ^ ((xi,...,x n )(viSi)) Vi . 

To prove (3) we can, without loss of generality, restrict ourselves to the case where we have an 
arbitrary permutation (x±, . . . , x n ) and (j/i, . . . , y n ), the unique permutation satisfying y Vi = i. We 
proceed by constructing a T(S n , T n )-path between these two permutations. Obviously, there exists 
an unique Vj such that n = x Vj and in the tree 7 n there exists a unique path of length at most 
diam(T„) < n — 1 connecting m and v n . Accordingly, there is a T(S n , T„)-path of length at most 
diam(T„) between (xi) and a permutation (z,) such that z Vn = n. Our construction in (2) implies 

Vi<n; ((zi, . . . , z n )(vi s l )) Vn = n, 

whence we can proceed inductively, moving (n— 1) to the w„_ith position using the subtree 7 n —\. 
We consequently arrive at 

n 

diam(r(5„,r„)) < ^diam(T ? ) < 

i=2 

and the proof of the lemma is complete. □ 
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In case of star-transpositions, i.e. T n = {(lj) | 2 < j < n}, we have the following situation: 

(3.5) {1} c {(1 2)} c {(1 2), (1 3)} c • • • C {(1 j) | 2 < j < n}, 

{vi Sj) = (£+11) i.e. Si = 1 and diam(r(5 , n , {(1 j) | 2 < j < n})) < 2(n — 2). Similarly, in case of 
the bubble-sort graph, T n = {(ii + 1) | 1 < i < n — 1} 

(3.6) {1} c {(12)} c {(12), (23)} c ■ - - C {(** + 1) | 1 < * < n - 1} 
we arrive at (vi Si) = (i + 1 i) and diam(r(S'„, {(i t+l)|l<i<n — 1})) = Q) ■ 

Lemma 2. Suppose T n is a minimal generating set of transpositions. We select permutations with 
independent probability A„ = , where n~^ +s < e n , for some S > 0. Then each permutation, 
v, is contained in a T n -subtree 7 n (v) of size \_\n^ \ with probability at least p(e n ). 

Proof. We construct the subtree 7 n (v) by means of a branching process [TS] within T(S n ,T n ). 
Without loss of generality, we may initiate the process at id and assume (for notational convenience) 
n — € N. We shall begin by specifying an appropriate move-set (of transpositions) by which 
the offspring of the branching process is being generated. To this end, let 

N = {( Vj Sj ) | 1 < j < n - in* - 1} C T n . 

Note that N acts trivially on labels Vh where h > n— hni — 1. 

The process is defined as follows: we set Uq = C N and Mo = Lq = {id} C S n . At step (j + 1), 
suppose we are given Uj C N, Mj and Lj C S n . In case of Lj = or \Uj\ = [\ni \ — 1 the 
process stops. Otherwise, we consider the smallest element lj G Lj and select among its smallest 
(n — Lf" 2 ^ 3 ] — 1) neighbors, contained in N \ Uj with independent probability A„. Let x\ = lj r Xl 
be the first selected Zj-neighbor and r Xl £ N \ Uj. We then set Uj(xi) = UjU{r Xl } and proceed 
the selection with the smallest (n— L| n2 ^ 3 J — 1) neighbors contained in N\Uj(xi) instead of those 
in N \ Uj. After all lj neighbors are checked and given that (xi, . . . ,x s ) have been subsequently 
selected, we set 

Uj+i = UjU{r Xl ,...,r Xs } 
L j+ i = (Lj\{lj})U{xi,...,x s } 
M i+ i = MjU{x u ...,x s }. 

The minimality of T n and the fact that each T„-elemcnt is used at most once implies that this 
process generates a tree, i.e. each Mj + i -element is considered only once. Furthermore, in view of 

(3.7, I±^. („- L ?„lj-l)>l 
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this process can be viewed as a branching process embedded in T(S n ,T n ). Therefore 

1 2 \ 
Mj\ = L^« 3 J I for some j J > p(e n ) 

and the lemma follows. □ 



By choosing k sufficiently large, we next enlarge the Lemma[2]-subtrees via an inductive construction 
to arbitrary polynomial size. We remark that Lemma[T]is of central importance for the construction 
of the Lemma [3]-subtree, since some T„-generators will be use multiple times. 

Lemma 3. Suppose k is arbitrary but fixed and let 8 n > 0(n s ). Then each T n -vertex is contained 
in a r n - subcomponent of size > Ckfi^O^ > Cfcni+ fe5 with probability at least 

(3.8) S k (e n ) = p(e n )(\-e-^), 

where (3k > and e n > n~i +s . 



Proof. Without loss of generality we may assume tt = id, fx n , £ n £ N and set 

A m = {(v™sf)eT n \l< 3 <fi n }. 

where {vf sf) = (« z „ +J - + ( m _i) jL(n _i s, n+J - + ( m _ 1 ) Atn _ 1 ). That is, A m is the "first" (in the sense of 
the labeling given by the sequence (^2,^3, ...,«„)) subset of T„-transpositions that act on labels 
Vi, where i < z n + m/i n — 1 for 1 < m < k. 

We set w\' = s^) and consider the branching process of Lemma [2] at n = id, denoting the tree 
of size [\ni\ by T 1 . Let 

{wV e A 1 I 3x G T 1 ; a; • 10^ € r n } . 



According to Lemma [T] 



Vx,y eT^VwP eAy, x-wf ] ^yw£\ 



whence 



(3.9) E[Fi] =//„•!-!- 



1 + e» 

71 - 1 



H„ ( l-exp(-(l + e„)-n 3) 



Using large deviation inequalities [9] , we conclude that there exists some constant c\ > such that 



Fr < -E[Ki] ) < exp (- Cl • E[y x ]) 
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We select the smallest element, x^^, from the set {a; • | x G T x ,x ■ G r n } and start the 
branching process of Lemma[2]at x^jy As a result, we derive the tree C2(xuj)) of size |_i n ^J with 
probability at least p(e„). According to Lemma [21 the generation of this tree exclusively involves 
labels Vj where j < z n — 1. Therefore, since any two smallest elements xr^j^ and £(i 2 j 2 ) differ in 
at least one of two coordinates with labels Vj 1 ,Vj 2 for z n < < z n + fj, n , we have 

C, 2(i(i 1 j 1 ))nc 2 (i(i jjj )) = 0. 

Let Xi be the r.v. counting the number of these new r„-subcomponents. In view of eq. p.9p . we 
obtain 

E[*i] > p(e„) • E[Yi] ~ p(e n ) ■ ,i„ (\ - exp(-(l + e^n"*)) = 9 n > 0(n s ). 
Again, using large deviation inequalities we conclude that there exists some c\ > such that 

¥(x 1 < ^e n ) < cx P (- Cl e n ) 

or equivalcntly, since the union of all the C^x^j^-subcomponents with T 1 forms a T(S n ,T n )- 
subtrce, T 2 , we have 

(3.10) P (jT 2 | < L|« 2/3 J • \e^j < exp(-ci^). 

We now proceed by induction: 

Claim: For each 1 < i < k there exists some constant Cj > and a T(S n , T„)-subtree T l such that 

p(in< L i^/3j.i_^)< cxp (_ c ^ n ). 

We have already established the induction basis. As for the induction step, let us assume the 
claim holds for i < k and let Ci(a) denote a subcomponent generated by the branching process of 
Lemma[2]in the z-th step. We consider the T n -transpositions uif' +1 ' G A i+ i. We consider 

the minimal elements, x" of 

Y l+1 = G A+1 I 3x G C t {a);x- G r„} 

at which we initiate the branching process of Lcmma[2] The process generates tree subcomponents 
Ci+i(x") of size Li 71 ^] with probability > p(e„). Any two of these are mutually disjoint and let 
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Xi + \ be the r.v. counting their number. We derive setting q n = [jn 2 ^ 3 \ 

\t 1+1 \ < Qn^ett 1 ) < p (\r\ < q n ±ei\ +p (V +1 I < gn-^e 1 1 n > <?„^ 



failure at step i failure at step i + 1 conditional to |T ; | > g„i9j, 

< + ^ .(i_ e -**») f Cl>0 

induction hypothesis large deviation results 

< e -c i+ i0„ 

and the claim follows. 

Therefore each r„-vertex is contained in a tree-subcomponent of size at least 0{nie k ) > 0{ni +k5 ), 
with probability at least p(e n )(l — exp(— CkO n )) and the lemma is proved. □ 



4. Vertices in small components 

Let T n _k denote the set of r„-vertices contained in components of size > 0{n kS+ i) for sonic 
< 5 < 1. In this section we prove that |r„^| is a.s. ~ p(e n ) n!. In analogy to Lemma 3 of 
[30] we first observe that the number of vertices, contained in r„-components of size < Ckn kS+ i, 
is sharply concentrated. The concentration reduces the problem to a computation of expectation 
values. It follows from considering the indicator r.vs. of pairs (C, v) where C is a component and 
v G C and to estimate their correlation. Since the components in question are small, no "critical" 
correlation terms arise. 

Lemma 4. Let ui n = \T n \ T n! k\ and X n = ^zf, where n~i +5 < e n < A, for some A > 0. Then 
we have 

(4.1) P M uj n - E[w„] | > ^EMJ = o(l). 



With the help of Lemma U we proceed by computing the size of T n ^- 

Lemma 5. Let \ n = ^zf, where n~i +s < e„ < 1 and suppose k G N is sufficiently large. Then 

(4.2) |r n , fc | ~ p(e n )^±^n! a.s.. 

n — 1 
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Proof. First we prove for any n z +s < e n < A, where A > 
(4.3) (l-o(l))p(e n )|r n | <|r niJfc | 

By assumption we have 



W < (1 - S k (e n ))\T n \. 



In view of Lemma |4l we derive 



u n < I 1 + O(-) ) E[u}„] < ( 1 - S k (e n ) + O(-) ) |r„| a.s 



whence 



|r„, fe | > (s k (e n ) - 0(^)\ \T n \ = (1 - o(l))p(e„)|r n | a.s.. 



Next we prove for n s + 5 < e n < 1 



(4.4) 



|r„, fc | < (i + (i))p(e n )|r„| 



For this purpose we consider the branching process on a (n — l)-regular rooted tree T r * where the 
r.v. ^* of the rooted vertex r* is B{n — 1, A rl ) distributed while the r.v. of any other vertex r 
has the distribution B{n — 2, A„). Let C r » denote the component generated by such a branching 
process. Bollobas et al. [8] showed that 



(4.5) 



P(|C r . | = *) = (! + o(l)) 



(X n -(n-2)Y 



(n-2)(l-A n ) 
(n-3) 



where i = i(n) — > oo as n — » oo. The key observation is an inequality due to Bollobas aZ. [S], 
relating this process with the construction of a spanning tree of a r„-component at vertex r, 



(4.6) 



P(|CV*| < m) <P(|C r | < m). 



Eq. (|4.6p follows immediately from the observation that during the generation of a spanning tree 
of a component, there are for each vertex at most (n— 2) neighbors, while in T r * there exist exactly 



(n— 2) neighbors. Suppose now k is sufficiently large, satisfying kS + | > 3| . Then n 2 < c k ■ 
for sufficiently large n, i.e. 

(4.7) P(c k ■ n k5+ % < \C r * | < oo) < P(n 2 < |C r « | < oo). 



-.kS+i 
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This probability can be estimated as follows 

F(n 2 < |C r * | < oo) = Yj V (\ Cr * I = 



i>n 2 



< E( 1 +°( 1 )) 



i>n 2 



(An ■ (n - 2) ) 
i\/2m 



i-i r 



(n -2)(l- A n ) 
(n-3) 



7ii — 3i+2 



< £ [(l + ^e" 6 -]' < £ c(e)' = (e- ln W"), 

i>n 2 i>n 2 



where < c(e) < 1. We accordingly derive 



(4.8) 
(4.9) 



a* | < c fc = P(|a*| < oo) -P( Cfc -n* 4+ ' < |C r .| < oo) 

> (l-(l + (l))p( £n ))- (e-" ln( ' l) "), 



where ttq denotes the survival probability of the branching process on T r * , see Corollary [T] From 
eq. (|4.6|) and eq. (|4.9|) we immediately obtain, taking the expectation 



(4.10) E[w n ] > (l-p(e„))|r„|+o(l) 

Lemma |4] accordingly implies 
(4.11) 

whence the lemma. 



(l-p(e„)-0(-))|r n | <u n a.s. 
n 



□ 



We established in proof of Lemma [5] the relation 

P(n 2 < I < oo) < J2 + e «) e " e "] 1 ^ E c ( £ ) 4 = °( e_ '"^ ")• 

i>n 2 i>n 2 

Therefore, relaxing the condition e„ < 1, we loose the upper but not the lower bound. That is, we 
have for probabilities A„ = -^r where A > 

Corollary 2. Let X n = — where < A and suppose k € N is sufficiently large. Then 
(4.12) |r„, fc | > (l-o(l))7T P (A)— ^-rn! a.s.. 
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5. The main theorem 



We show in this section that the unique giant component forms within r n k for two reasons: first, 
any r^fc-vertex is a priori contained in a subcomponent of size > 0{n + ^) 1 limiting the number of 
ways by which r^-splits can be chosen and second there are many independent paths connecting 
large T(S n , T n )-subsets. We first prove Lemma |6] according to which T n .k is "almost" 2-dense in 
T(S n ,T n ). 

Lemma 6. Let k E N, A n = ^rf where e n > n~i +5 for some 5 > and let furthermore 
As = {v | \d(v,2) H r„.fc| < Ck ■ n s }. Then we have for some k > 0, ¥(v G As) < cxp(— nn s ) and 
there exists some < p < k such that 

\As\ < nl exp(— pn s ) a.s.. 



Proof. We consider now the action of the transpositions 



A k+1 = a * +1 ) e T„ 1 1 < i < i n ) 



where w 



(fe+i) 



,fc+i s k+i 



3 ) = {v Zn -i + j + k^ n Sz„-i+ 3 +kp, n ) and set 

d (k+1 \v,2) = {v ■ wf +1) ■ w {k+1) \l <Kj< i n }. 



We proceed by establishing a lower bound on the cardinality of d^ k+1 \v, 2). Since T„ is a minimal 
generating set, any sequence of distinct T„-transpositions is acyclic. Therefore 

j l 2 



M (fc+1) (t>,2)| > 



1 ft. 



2! V 2 



(2!) 



M- 1 



2(k + 1) 



(l-o(l)). 



Let A k 



2(fc+l) 



/(2!) 2 and Z be the r.v. counting the number of vertices contained in the set 

d^ k+1 '(v, 2) n Tn t k, whose subcomponents are the trees constructed in Lemma[3] We immediately 
compute 

HZ) > X n ■ 8 k (e n ) ■ \S k +V(v, 2)| ~ A k n* • ■ p(e n )(l - e"^") > A k ■ n 5 . 

n — 1 

The key observation is the following: the construction of the Lemma [3]-subtrees did not involve 
any labels w z „-i+j+fcp„ , i.e. any two such subtrees remain vertex-disjoint. Therefore the r.v. Z is 
a sum of independent indicator r.vs. and Chernoff's large deviation inequality [5] implies 

(5.1) 



(v e A s ) < P ( Z < - A k ■ n° ) < exp(-nn a ) for some k > 0. 
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Consequently, the expected number of vertices contained in Ag is bounded by n\ exp(— nn s ). Now 
Markov's inequality F(X > tE(X)) < 1/t, for t > 0, guarantees \Ag\ < n\ ■ e~ pn5 a.s. for any 
< p < k and the lemma follows. □ 



Next we show that there exist many vertex disjoint paths between r^-splits of sufficiently large 
size. The proof is analogous to Lemma 7 in [30] . We remark that Lemma [7] does not use an 
isoperimetric inequality [14] . It only employs a generic estimate of the vertex boundary in Cayley 
graphs due to Aldous [H [6] . 

Lemma 7. Let (S, T) be a vertex-split of Y n ^ with the properties 

(5.2) 30<po<Pi<l; (n-2)! < \S\ = Po \T nM \ and (n - 2)! < |r| = pi|r„ )fc |. 
Then there exists some c > such that a.s. d(S) is connected to d(T) in r(S'„,T n ) via at least 

(5.3) c(ti-5)!/(?i-1) 7 
vertex disjoint (independent) paths of length < 3. 



Proof. We distinguish the cases |B(5,2)| < f n! and |B(5,2)| > In the former case, we employ 
the generic estimate of vertex boundaries in Cayley graphs [3] 

In view of eq. f|5 . 2|) and Lemma [J eq. (|5.4p implies 

(5.5) 3d 1 >0; |d(B(5, 2))| > ^- |B(S, 2)| >d!-(n- 4)!. 

n 

According to Lemma [BJ a.s. all but < n\ e~^ n permutations are within distance 2 to some r„ fc- 
vertex, whence 

(5.6) |d(B(S,2))nB(T,2)|>d 2 -(ii-4)! a.s.. 

Let (3 2 G d(B(5, 2)) n B(T, 2). Then there exists a path (ai,a 2 ,/3 2 ) such that a x € d(S), a 2 G 
d(B(S, 1)). We distinguish the cases 

(5.7) |d(B(S,2))nd(B(T,l))|>da,i(n-4)! and |d(B(S, 2)) n B(T, 1)| > d 2 , 2 (n - 4)!. 
For |d(B(5, 2)) n d(B(T, 1))| > d 2 ,i (n - 4)!, we consider the set 

T* = G d(T) I d(Jh,frt) = IJor some (3 2 G d(B(T, 1))}. 
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Evidently, at most n — 1 elements in d(T) can be connected to a fixed /3 2 , whence 

|T*|>id 2il (n-5)!. 

Let T\ C T* be some maximal set such that any pair of Ti-vertices ((3i,f3[) has at least distance 
diPi,^) > 6. Then |T X | > |T*|/(n-l) 7 since |B(u,7)| < (n~l) 7 . Any two of the paths from d (S) 
to T\ C d(T) are of the form (ai, a 2 , /32, and vertex disjoint since each of them is contained in 
T(/3i,3). Accordingly there are a.s. at least 

(5.8) ^d 2A (n-5)\/(n-l) 7 

vertex disjoint paths connecting d(S) and d(T). In case of |d(B(S, 2)) n B(T, 1)| > c? 2 ,2 ( n ~ 3)! we 
analogously conclude, that there exist a.s. at least 

(5.9) d 2a (n-A)l/(n-lf 

vertex disjoint paths of the form (a x , a 2 , /3 2 ) connecting d(5 l ) and d(T). 

It remains to consider the case \B(S, 2)| > | • nl. By construction both: S and T satisfy eq. (|5.2p . 
whence we can, without loss of generality assume that also |B(5, 2)| > | • n! holds. But then 

|B(5,2)nB(T,2)| > in! 

and for each a 2 S B(5, 2) n B(T, 2) we select ot\ e d(i5) and /3i G d(T). We derive in analogy to 
the previous arguments that there exist a.s. at least 

(5.10) d 2 (™-3)!/(n-l) 5 

pairwise vertex disjoint paths of the form (ai, a 2 , Pi) and the proof of the lemma is complete. □ 



Proof of Theorem [TJ To prove the theorem we employ an argument due to Ajtai et ai [T] 
originally used for n-cubes and independent edge-selection. We proceed along the lines of [3D] and 
select the T(S n , T„)-vertices in two distinct randomizations. 

Let xi,X2 > such that ^ + = 1. First we select with probability 1+e ™/ Xl anc j second with 
probability . The probability of not being chosen in both rounds is given by 

/ _ 1 + e n /xi \ ( _ e n \ > _ 1 + e,i 
\ n J \ X2 ■ n J ~ n ' 

whence it suffices to prove that after the second randomization there exists a giant component 
with the property \c£'\ - |r njfc |. 
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After the first randomization each T(S n , T„)-vertex has been selected with probability 1+€ ^/ Xl and 
according to Lemma [5j we have 

(5.11) |r„ jfc (ari)| ~ p(e„/xi) | Ln(^i)| a.S., 

where T n (xi) C L„. Suppose T n ^(xi) contains a "large" component, S. To be precise a component 
S of size 

(n - 2)! < |5| < (1 - 6) |r„^(x!)|, where 6 > 0. 

Then there exists a split of T n ^(xi), (5, T), satisfying the assumptions of Lemma [JJ (and, of 
course, d(S) n d(T) = 0). We observe that Lemma [3] limits the number of ways these splits can be 
constructed. Let Mk = Li n ^J^n ^ °k n kS+ i for some Ck > 0. Obviously there are at most 2 n ^ Mk 
ways to select S of such a split. Now we employ Lemma [7J In view of (n — 2)! < Lemma [7] 
implies that there exists some c > such that a.s. d(S) is connected to d(T) in T(S n ,T n ) via at 
least c • n!/n 12 < c ■ |S'|/n 10 vertex disjoint paths of length < 3. 

We next perform the second randomization and select T(S n , T„)-vcrticcs with probability £ "^ X2 . 
None of the above c • IS'l/n 10 paths can be selected during this process. Since any two paths are 
vertex disjoint the expected number of such splits is, by linearity of expectation, less than 

(5.12) 2 nl / M * (1 - (e n /x 2 n) 4 )^ < 2" ! / M , e -c'«!/™ 16 for some c, d > 0. 

Accordingly, choosing k sufficiently large the expected number of these r„ j / c (a;i)-splits tends to 
zero, i.e. for any k > fco € N there exists a.s. no two component split (S,T) of T nt k(xi) with the 
property po|r n ,fc(£i)| = \S\ < \T\ . Consequently, there exists some subcomponent C n (xi) with 
the property 

\Cn(%i)\ = |r n ,fe(xi)| ~ p(e n /x 1 ) |r(ari)| a.s., 

obtained by the merging of the subcomponents of size > 0{n kS+ i) generated during the first 
randomization via the paths selected during the second. Since p(e n /xi) is continuous in the 
parameter e n /xi, see eq. ()3.1[) . we derive, for x\ tending to 1 

(5.13) \Cjp\ = lim \C n { Xl )\ ~ p(e n )\T n \ a.s. 

XI — >1 

It remains to prove uniqueness. Any other largest component, C„, is necessarily contained in T n ^k- 
However, we have just proved |C„ | ~ p(e„)|r„| and according to LemmaO p(e„)|r„| ~ |r n ,fe|. 
Therefore \C n \ = o(\C^\), whence C { n 1] is unique. □ 
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